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This survey paper is not a complete reference guide to number-theoretical ap- 
plications of ergodic theory. Instead, the plan is to consider an approach to a class 
of problems involving Diophantine properties of n-tuples of real numbers, namely, 
describe a specific dynamical system which is naturally connected with these prob- 
lems. 

1. A glimpse at Diophantine approximation 

For motivation, let us start by looking at two (vaguely defined) Diophantine 
problems: 

Problem 1. Given a nondegenerate indefinite quadratic form of signature {m,n), 
study the set of its values at integer points. 

Here is a precise statement along these lines, conjectured in 1929 by Oppenheim 
[Opl] and proved in 1986 by Margulis [Ma3]: 

Theorem 1.1. Let B be a real nondegenerate indefinite quadratic form of signature 
(m,n), k = m + n > 2. Then either B is proportional to a rational form, or 
inf xe z^{o} \B{x)\ = 0. 

One possible approach to the problem is to write B(x) = XS„ hn (gx) , where 
A G K, g G SL k (R) and 

Of \ _ 2 , , 2 _ 2 _ 2 

(a linear unimodular change of variables). Then the problem reduces to studying 
values of the standard form S m . n of signature (m, n) applied to the collection of 
vectors of the form {gx | x € Z fe }. And the dynamical approach consists of studying 
the action of the stabilizer of the form S m , n on such collections. 

Problem 2. Given m vectors yi , . . . , y m S 1" ( viewed as linear forms x t— > y^ • x, 
x G 1" ) how small (simultaneously) can be the values of |yi • q + Pi £ Z, when 
q = (gi,..., q n ) E Z n is far from ? 

Let us also illustrate this by a conjecture, this time still open. Here we specialize 
to the case of just one linear form given by y G K n . The following is known as 
Littlewood's (1930) Conjecture: 
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Conjecture 1.2. For every y e W 1 , n>2, one has 

(1.1) inf |yq + f>|-n + (q)=0, 

qGZ™\{0},p6Z 

where n + (q) is defined to be equal to Y\7=i max (l9i|i 1) or > equivalently, II gi ^o \li 
To approach Problem 2, one can put together 

yi • q + pi, . . . ,y m • q + p m and qi,...,q n , 



and consider the lattice 

Yq + p 



(1.2) 
where 



q 



pez ra , q ez' 1 =i y z 



(i.3) £ 

and y is the matrix with rows y\ , . . . , y^. In this case, the orbit of the lattice 
(1.2) under a certain group action provides a way to study Diophantine properties 
of yi, . . . ,y m . 

In both cases, we use the initial data of a number-theoretic problem to construct 
a lattice in a Euclidean space, and then work with the collection of all such objects 
(lattices). Our goal is to describe several principles responsible for a particular 
class of applications of flows in the space of lattices to number theory. For more 
details and a broader picture the reader is referred to a number of extensive reviews 
of homogeneous actions and interactions with number theory which have appeared 
during the last 10 years, such as: ICM talks of Margulis [Ma5], Ratner [Ra3], Dani 
[D5] and Eskin [E], books [St2] and [BMa], and survey papers [D6, D7, KSS, Ma7, 
Stl]. 

The structure of this paper is as follows: in the next section we collect all basic 
facts about the space of lattices, and discuss a "lattice" approach to studying values 
of quadratic forms at integer points (Problem 1 and Theorem 1.1 in particular). 
Then in §§ 3 and 4 we take several sub-problems of Problem 2 and describe recent 
results obtained by means of homogeneous dynamics. The last section is devoted 
to Conjecture 1.2 and related issues, that is, so called "multiplicative Diophantine 
approximation" . 

2. The space of lattices 
Phase space. Fix k G N and consider 

f2 = f the set of unimodular lattices in R fe 

(discrete subgroups with covolume 1). That is, any lattice A 6 O is equal to 
Zvi © • • • © Zvfc , where the set {vi , . . . , } (called a generating set of the lattice) 
is linearly independent, and the volume of the parallelepiped spanned by vi, . . . , Vfc 
is equal to 1. 

An element of f2 which is easy to distinguish is Z fe (the standard lattice). In 

fact, any A e £1 is equal to gZ k for some g e G = f SLh(M). That is, G acts 

transitively on f2, and, further, T = f S'Lfc(Z) is the stabilizer of Z fc . In other words, 
ft is isomorphic to the homogeneous space G/T. 
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Measure. One can consider a Haar measure on G (both left and right invariant) 
and the corresponding left-invariant measure on f2. It is well known that the result- 
ing measure happens to be finite. We denote by \i the normalized Haar measure 
on SI. 

Topology. Two lattices are said to be close if generating sets which are close to 
each other can be chosen for them. This defines a topology on £1 which coincides 
with the quotient topology on G/T. An important feature is that SI is not compact 
(in other words, T is a non-uniform lattice in 67). More precisely, one has 

Theorem 2.1 (Mahler's Compactness Criterion, see [R]). A subset K of is 
bounded iff there exists e > such that for any A G K one has inf xg A^{ } ll x ll > £ - 
In other words, define 

fl e d = {A G n | ||x|| < e for some x e A \ {0}} ; 
then \ Q, e is compact. 

Action. is a topological G-space, with the (continuous) left action defined by 

S A = {gx|xeA} or g{hT) = {gh)T . 

One can consider the action of various subgroups (one- or multi-parameter) or 
subsets of G. Thus one gets an interesting class of dynamical systems. Several 
important features of these systems are worth mentioning. 

First, the geometry of the phase space is "uniform": a small enough neighbor- 
hood of every point of is isometric to a neighborhood of identity in G. In other 
words, many geometric constructions can be reduced to algebraic manipulations in 
G. 

Second, the very rich representation theory of G can be heavily used. Namely, 
the G-action on can be studied via the regular representation of G on L 2 (Q) 

The two features above in fact apply for all homogeneous actions, that is, actions 
of subgroups of a Lie group G on the quotient space G/T where F is a lattice in G. 
There are also important features specifically for the space ft — SXfc(R)/S , Lfc(Z): 
namely, combinatorial structure of the space of lattices, as well as intuition coming 
from the theory of Diophantine approximation. 

In what follows we will focus our attention on the space SI, but most of the 
results will be valid in much bigger generality of homogeneous actions, which will 
be indicated. The reader is referred to [AGH, Ma6, R, St2, Z] for general facts 
about Lie groups, discrete subgroups and homogeneous spaces. 

Classification of actions. Let G be a Lie group and T a discrete subgroup. 
Since g(tiA) = (ghg~ 1 )gA for every A G G/T and g,h G G, local properties of 
the (/-action are determined by the differential of the conjugation map, Ad g (x) = 

d(gexp(tx)g — ( nere x b e i on g S to the Lie algebra of G). An clement g G G is said 
to be: unipotent if (Ad g — Id) J = for some j G N (equivalently, all eigenvalues of 
Ad g are equal to 1); quasi-unipotent if all eigenvalues of Ad s are of absolute value 
1; partially hyperbolic if it is not quasi-unipotent. 
Given g G G, define 

fl^fc) = {h G G | g- l hg l -» e as I -> ±00} 
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(expanding and contracting horospherical subgroups) . Then G is locally a direct 
product of H~ (g), H + (g) and another subgroup H°(g), and g is quasiunipotent iff 
H°(g) = G (that is, H~(g) and H + (g) are trivial). Furthermore, for any A e G/T 
the orbits H~(g)A, H + (g)A and H°(g)A are leaves of stable, unstable and neutral 
foliations on G/T. 

We now specialize to the case G = SXfc(R) and T = SXfc(Z). 

Example. The simplest case is when k = 2: then £1 = G/T is isomorphic to the 
unit tangent bundle to the surface H 2 /S , L 2 (Z)- The geodesic flow is then given by 

the action of ^ ^ ^ , and the horocyclc flow - by the action of ^ ^ * ^ (the 

simplest example of a unipotent flow) . 

More examples. Suppose that g £ G is diagonalizable over R, and take a basis of 

R fe in which g = diag(Ai, . . . , Ai, ,A;,...,A ; ), Ai>--->A ; . Then H~ (g) 

" r ' " * ' 

il times i x times 

and H + (g) are subgroups of lower- and upper- triangular groups. An important 
special case occurs when g as above comes from "the most singular" direction in 
a Weyl chamber of the Lie algebra of G; that is, when it has only two distinct 
eigenvalues. In this case one can write k = m + n and consider a one-parameter 
subgroup of G = 5Xfc(R) given by 

(2.1) gt = diag(e t / m , . . . , e*/ m , e"*/", . . . , e"*/") . 

Then the expanding horospherical subgroup of G relative to g\ is exactly \Ly \ 
Y e M mxn (R)}, where Ly is as defined in (1.3). 

Ergodic properties. Here the main tool is the representation theory of semisim- 
ple Lie groups. By a theorem of Moore [Mol], the action of any noncompact closed 
subgroup of G on ft = G/T is ergodic and, moreover, mixing; in other words, ma- 
trix coefficients (gip, ip) of square-integrable functions on with mean value zero 
tend to as g — > oo in G. (Here (•, •) stands for the inner product in L 2 (f2).) In 
fact for smooth functions this decay is exponential, as shown in the following 

Theorem 2.2 (Decay of correlations). There exists [3 > such that for any 
two functions ip, ip e G^ mp (G/r) with J ipdu, = J ip du, = and any g E G one has 



/< 



it 



(g(p ■ V>) du,\ < const (ip, ip)e ,3 " 9 " 
In particular, if gt is partially hyperbolic, then 
(2.2) | J(g t tp- tp)dfj,\ < const(<£,V,St)e 

See [Mo2, Ral] for k = 2, [KS] for k > 2. 

The following result can be derived from the mixing property of partially hyper- 
bolic actions on G/T: 
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Theorem 2.3 (Uniform distribution of unstable leaves, [KM1]). Let g t be a par- 
tially hyperbolic one-parameter subgroup of G, H = H + (gi), v a Haar measure on 
H . Then for any open subset V of H , any ip G C^ mp {G /V) and any compact subset 
Q of G/Y, the average of ip over the g t -image ofVA, A G G/T, tends to the integral 
of ip as t — > oo uniformly (in A) on compact subsets of G/T; that is, 

(2.3) \ [ <p(hg t A)dv(h)^ [ <pdfi. 

V{g t V 9-t) Jg t Vg- t J G/T 

Remarks. Moore's theorem (that is, a criterion for mixing of subgroup actions) was 
proved under the assumption that 

G is a connected semisimple Lie group with finite center 

(2.4) . 

and no compact factors, and T is an irreducible lattice in G , 

(A lattice is irreducible if it is not, up to commensurability, a product of lattices in 
simple factors of G.) See also [BM, Ma5] for more general ergodicity and mixing 
criteria. Theorems 2.2 was proved in [KS] assuming (2.4) and in addition that 

(2.5) all simple factors of G have property (T). 

One also knows, see [Bek, Lemma 3], that Theorem 2.2 holds when the group G 
is simple. In [KM3] it was shown that one can remove condition (2.5) but instead 
assume that L C 67 is a non-uniform lattice. 

Theorem 2.3 is also proven in the generality of the assumption (2.4) (in fact, 
one only needs mixing of the gt-action), and as long as Theorem 2.2 holds, the 
convergence in (2.3) is exponential in t. See [KM1, K4] for more details and gen- 
eralizations, and [Ma7, Remark 3.10] for references to other related results and 
methods. 

Recurrence of unipotent trajectories. In this subsection we are back to the 
case G = SLk(M.) and T = SXfc(Z). It is an elementary geometric observation that 
horocyclic trajectories on SL 2 (i$.)/ SL 2 CZ) do not run off to infinity. It is much 
harder to prove that the same holds for any unipotent flow on = S , L / t(R)/5ife(Z) 
for ft > 3 [Ma2, D3]. The theorem below, due to Dani (1985), is a quantitative 
strengthening; it shows that for any unipotent orbit one can find a compact subset 
of £1 such that the density of time that the orbit spends in this set is as close to 1 
as one wishes: 

Theorem 2.4 [D3]. For any A G il and any 8 > there exists e > such that for 
any unipotent subgroup {u x \x G R} of G and any T > one has 

\{x G [0,T] | u x A G ftjl < 5T. 



The proof is based on the combinatorial structure of the space of lattices. We 
will obtain the theorem above as a corollary of a more general fact in §4. 

Orbit closures of unipotent flows. It has been proved by Hedlund that any 
orbit of the horocycle flow on S , L 2 (R)/5 , L 2 (Z) is either periodic or dense. A far- 
reaching generalization has been conjectured by Raghunathan and proved in full 
generality by Ratner. In particular, one has the following 



6 



DMITRY KLEINBOCK 



Theorem 2.5. [Ra2] Let G be a connected Lie group, Y a lattice in G, and let 
U be a subgroup of G generated by unipotent one-parameter subgroups. Then for 
any x G G/T there exists a closed subgroup L containing U such that the closure 
of the orbit Ux coincides with Lx and there is an L-invariant probability measure 
supported on Lx. 

We emphasize that Theorem 2.5 allows one to understand all (not just almost 
all) orbits. It is this feature which is responsible for applications of this theorem to 
number theory. Let us illustrate it by sketching the reduction of the Oppcnhcim 
Conjecture (Theorem 1.1) to Ratner's theorem. 

Corollary 2.6. Let S(x\,X2,Xs) — 2xix 3 — x\ , and 

H s = {he SL 3 (R) | S(hx) = S(x) Vx e R 3 } = SO(2, 1) 

(the stabilizer of S). Then any relatively compact orbit HsA, A a lattice in R 3 , is 
compact. 

Proof. Hs is generated by its unipotent one-parameter subgroups, namely 

/l t t 2 /2\ / 1 0\ 

u(t) =01 t and u T (t) = it 10, 
\0 I j \t 2 /2 t l) 

and there are no intermediate subgroups between Hs and SL3 (M.). Hence by The- 
orem 2.5 any i/s-orbit is either closed or dense. □ 

The following is crucial for the deduction of the Oppcnhcim conjecture from the 
above corollary (implicitly stated in [CS] and later observed by Raghunathan) : 

Lemma 2.7. Let B be a real nondegenerate indefinite quadratic form in 3 variables. 
Write B(x) = XS(gx) for some g 6 SL3 (K). Then the orbit HsgZ 3 is relatively 
compact if and only if 

(2.6) |B(x)| > e for some e > and all x e Z 3 \ {0} . 

Proof. By transitivity of the action of the stabilizer Hb of the form B on the level 
sets of B in R 3 \ {0} and by continuity of B at zero, assertion (2.6) is equivalent 
to the norm of hx being not less than e for some e > and all x e Z 3 \ {0} and 
h G Hb- The latter, in view of Theorem 2.1, is equivalent to the orbit Hb% 3 being 
relatively compact in f2 = SL^(R) / SL^(1j). But Hb — g~ 1 H$g 1 therefore the orbit 
HbI? is relatively compact if and only if so is the orbit Hggl? ■ □ 

Corollary 2.8. Let B be a real nondegenerate indefinite quadratic form in 3 vari- 
ables. Lf (2.6) holds, then B is proportional to a rational form. 

Proof. The previous corollary implies that i?sZ 3 is compact; but since this orbit 
can be identified with Hb/Hb H SL3 (Z), this shows that Hb n SL%{L) is Zariski 
dense in Hb, which is equivalent to Hb being defined over Q, hence the claim. □ 

To derive Theorem 1.1 from the above corollary, one can then observe that if B 
is a real irrational nondegenerate indefinite quadratic form in k variables and I < k 
then R k contains a rational subspace L of dimension I such that the restriction of B 
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to L is irrational nondegenerate and indefinite (the proof can be found in [DM1]). 
Hence the validity of Theorem 1.1 in the case k = 3 implies the general case. 

It is worthwhile to mention that in the paper [Op2] Oppenhcim modified his 
conjecture replacing the claim inf x6Z k x { } |-B(x)| = with "0 is a non-isolated 
accumulation point of £?(Z fe )", which he showed to be equivalent to the density 
of B(Z fc ) in R. This stronger form of the conjecture was also proved by Margulis 
[Ma4]. It is not difficult to derive it from Theorem 2.5, first reducing to the case 
k = 3: if B is not proportional to a rational form, the orbit Hb^ 3 is not closed, 
hence (by non-existence of intermediate subgroups between Hb and SL^(R)) it is 
dense in 0, and the density of B(J?) in R follows. 

Finally let us briefly mention quantitative extensions of the above results. For 
B as above, an open interval Jet and a positive T one defines 



then one has Nj^b{T) — > oo as T — > oo for every nonempty I C M. The next 
theorem, a compilation of results from [DM2] and [EMM] , describes the growth of 
this counting function comparing it to the volume of Vj.b(T). 

Theorem 2.9. Let B be a real nondegenerate indefinite quadratic form of signature 
(m,n), m + n > 2, m < n, which is not proportional to a rational form, and let 
I C R be a nonempty open interval. Then 



In the exceptional cases, i.e. for forms of signature (2,1) and (2,2), there are 
counterexamples showing that Nj^b(T) can grow like const-|V/ ) B(T)|(logT) 1_e , and 
it is proved in [EMM] that const- |V7,b(T)| log T is an asymptotically exact upper 
bound for N ItB (T). 

For both parts the crucial step is to approximate the counting function Ni,b(T) 
by values of integrals of certain functions along orbits in the space of lattices. Part 
(a) relies upon Ratner's uniform distribution theorem (a refinement of Theorem 
2.5 for one-parameter unipotent subgroups), while the second part involves delicate 
estimates based on combinatorics of lattices. See [DM1, DM2, EMM, Ma7] for 
more details on the proofs and further refinements and generalizations. 

3. Metric linear Diophantine approximation and lattices 

The basic object to study in this section will be the set M mx „(R) of to x n 
real matrices. The word "metric" refers to considering solution sets of Diophantine 
inequalities in terms of the Lebesgue measure, or, when the sets are of measure 
zero, for finer analysis, in terms of the Hausdorff dimension. We refer the reader to 
the books [C, H, S3, Sp3] for a detailed exposition. 

In what follows, tp(-) will be a positive non-increasing function R + i— > R + . We 
are going to use it to measure the precision of approximation of a real number a 



Vi, B (T) {x G R k | B(x) C /, ||x|| < T} 



and 



N ItB (T)'^#(Z k nV I , B {T)), 




Ni, B (T) 
\Vi,b(T)\ 



= 1. 
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by rational numbers as follows: we would like the fractional part of aq to be not 
bigger than VKM) f° r infinitely many q G Z in order to call a "sufficiently well 
approximate" (this notion being dependent on tp) . More precisely, let us say that 
a is -approximable if there are infinitely many geZ such that 

I ok? + P\ < VKM) for some pGZ. 

In order to consider a matrix analogue of this notion, one needs to choose a norm 
on M fe (wc will do it by setting ||x|| = maxi<i<fc |xj|). Then one says that a 
matrix Y G M mxn (R) (viewed as a system of m linear forms in n variables) is 
tp-approximable if there are infinitely many q G Z" such that 

(3.1) ||Fq + p|| m < V(||q|r) for some p e Z m . 

The above normalization (raising norms in the power equal to the dimension of 
the space, instead of the traditional ||Yq + p| < ^>(||q||) as in [Dod] or [BD]) is 
convenient for many reasons: in our opinion it makes the structure more transparent 
and less dimension-dependent, and simplifies the connection with homogeneous 
flows. 

The whole theory starts from a positive result of Dirichlct, namely 

Theorem 3.1. Every Y G M mxn (M.) is ^-approximable, where V>o(x) = j. 

Clearly the faster ip decays, the smaller is the set of -0- approximate matrices. 
The next theorem, Groshev's [Gr] generalization of earlier results of Khintchine, 
provides the zero-one law for the Lebesgue measure of this set: 

Theorem 3.2 (The Khintchine-Groshcv Theorem). Almost every (resp. almost 
no) Y € M mxn (R) is ip- approximable, provided the sum ^2iLi VKO diverges (resp. con- 
verges). 

Now say that Y G M TOXn (R) is badly approximable if it is not ci/'o-approximablc 
for some c > 0; that is, if there exists c > such that ||Yq + p|| m ||q||" > c for all 
p G 7L m and all but finitely many qeZ" (cquivalcntly: all q G Z™ \ {0}). 

Note that in the case m = n = l, aeRis badly approximable if and only if 
coefficients in the continued fraction expansion of a are bounded. Using continued 
fractions, Jarnik proved in 1928 that badly approximable numbers form a set of 
Hausdorff dimension one; for arbitrary m, n the corresponding fact, i.e. full Haus- 
dorff dimension of badly approximable systems, was established by Schmidt in 1969 
[S2]. 

The following interpretation of this property in terms of homogeneous dynamics 
is due to Dani. Throughout this section we will fix m, n G N and put k = m + n. 

Theorem 3.3 [Dl]. Y G M mxn (M) is badly approximable iff the trajectory 
{g t LyTL k | t G with L Y as in (1.3) and g t as in (2.1), is bounded in the 

space of unimodular lattices in R fc . 

Instead of giving the proof (which, besides the original paper [Dl] can be found 
in [K2, K3]) let us point out the similarity between the above theorem and Lemma 
2.7. Indeed, denote by S the function on R k given by 

5(xi, ...,x k ) = max(|xi|, . . . , |x m |) m max(|x m+ i|, . . . , \x k \) n . 
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Then Y is badly approximable iff for some c > one has 



(3.2) 



S(Lyx) > c for all x = (p, q)eZ m x (Z n \ {0}) . 



Furthermore, the one-parameter group {g t } as in (2.1) is essentially (up to the 
compact part) the stabilizer of S, and, as in the proof of Lemma 2.7, one can show 
that (3.2) is equivalent to the norm of gtLyX- being bounded away from zero for all 
xeZ fc \ {0} and t > 0, that is, to the statement that {g t L Y 'L k | t G K+} n Ct £ = 
for some e > 0. 

From the above theorem and the aforementioned result of Schmidt, Dani derived 
Corollary 3.4. The set 



But as we saw in one of the examples of §2, {Ly | Y G M mx „(IR)} is the expanding 
horospherical subgroup of G relative to gi; thus the conjugation of the neutral 
and contracting parts plays no role and the trajectory g t A is bounded iff so is 



A possibility to generalize the statement of the last corollary (to actions of other 
one-parameter groups on other homogeneous spaces) was mentioned by Dani in [D2] 
and later conjectured by Margulis [Ma5, Conjecture (A)]. The latter conjecture was 
settled by Margulis and the author in 1996. Let us state here the following weakened 
version: 

Theorem 3.5 [KM1, K4]. Let G be a Lie group, T a lattice inG, F = {g t \ t > 0} 
be a one-parameter subsemigroup of G consisting of semisimple 1 elements, and let 
H = H + {gi) be the expanding horospherical subgroup corresponding to F. Assume 
in addition that the F -action on G/Y is mixing. Then for any closed F -invariant 
null subset Z ofG/T and any x G G/T, the set 



has full Hausdorff dimension. In particular, if {gt} is partially hyperbolic, then the 
set {x G G/T | Fx is bounded and Fx n Z = 0} has full Hausdorff dimension. 

Note that abundance of exceptional orbits is a feature of many chaotic dynamical 
systems. Sec e.g. [AN1, AN2, D4, Doll, U]. In the situation of Theorem 3.5, the 
construction of bounded orbits (or, more generally, orbits staying away from a fixed 
part of the space) comes from uniform distribution of images of expanding leaves 
(Theorem 2.3). More precisely, first one reduces the problem to the case (2.4), and 



{A G fl | {gtA | t > 0} is bounded} 



with {g t } as in (2.1), has full Hausdorff dimension. 

( B ( 

Proof. Indeed, any AgJ] can be written as I „ 





g t L Y Z k . □ 



{h G H Fhx is bounded and Fhx n Z = 0} 



1 g 6 G is called semisimple if the operator Ad 9 is diagonalizablc over C 



10 



DMITRY KLEINBOCK 



then considers natural "rectangular" partitions of H (called tessellations in [KM1] 
and [Kl]) and studies their behavior under the automorphism h i— > gthg^ t of H. 
Theorem 2.3 is used to show than one can cover the set of "bad" points by relatively 
small number of rectangles. Then those rectangles are used to create a Cantor set 
consisting of points with orbits avoiding Z and staying within a compact subset of 
G/T. See [KM1, Kl, K4] for details and generalizations 2 

So far we have illustrated the impact of ideas coming from Diophantine approx- 
imation to ergodic theory. On the other hand, Theorem 3.5 and Dani's correspon- 
dence (Theorem 3.3) can be used as an alternative proof of the aforementioned 
result of Schmidt on abundance of badly approximable systems of linear forms. 
What follows is another application to number theory, which produces a new result 
and demonstrates the power of ideas relating the two fields. 

Let us consider an inhomogeneous twist of approximation of real numbers by 
rationals. Instead of just one real number a take a pair (a, (3), consider an affile 
form x ax + (3 and look at fractional parts of its values at integers. Similarly, a 
system of m affine forms in n variables will be then given by a pair (Y, b) , where 
Y G M mx „(R) and b G M. m . Let us denote by Af mx „(R) the direct product 
of M mx „(R) and R m . Now say that a system of affine forms given by (Y,h) G 
M mxn (R) is ip-approximable if there are infinitely many q G Z™ such that 

|| Yq + b + p|P < VGIqlD fOT some p G Z m , 

and badly approximable if it is not ci/> - a PP rox i m able for some c > 0; that is, there 
exists a constant c > such that for every p G Z m and all but finitely many 
qeZ" \ {0} one has 

iir q +b+ P iniqir > s. 

It can be proved (and follows from an inhomogeneous version of the Khintchinc- 
Groshcv Theorem, see [C]) that the set of badly approximable (Y, b) G M mx „(R) 
is of measure zero. However, all known examples of badly approximable (Y, b) G 
M mxn (l) belong to a countable union of proper submanifolds of M mx „(R), hence 
form a set of positive Hausdorff codimension. Yet a modification of the dynamical 
approach described above works in this case as well. Namely, one considers a 
collection of vectors 

per, qGZ"| =L Y Z k + (^j , 

which is an element of the space = G/T of affine lattices in M fc , where 
G = f Aff(M fe ) = G x R k and f d = r x Z fe . 

In other words, 

= {A + w|Aefi, w g R fe }. 

Note that the quotient topology on coincides with the natural topology on 
the space of affine lattices: that is, Ai + wi and A 2 + w 2 are close to each other 



/ Fq + b + p 



2 In particular, it follows from the methods of [KM1] that one can remove the assumption of 
semisimplicity of elements of F, but then one needs the _F-action to be exponentially mixing, that 
is, (2.2) must hold for any ip,tp as in Theorem 2.2. 
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if so are Wj and the generating elements of Aj. Note also that CI is non-compact 
and has finite Haar measure, and that CI (the set of true lattices) can be identified 
with a subset of CI (affine lattices containing the zero vector). Finally, g t as in (2.1) 
acts on CI, and it is not hard to show that the expanding horospherical subgroup 
corresponding to g\ is exactly the set of all elements of G with linear part Ly and 

translation part ( \ ] , Y e M mxn (M) and b e M m . 



v 0. 

Now, for e > 0, define 

Cl £ = f {A G CI | ||x|| < e for some x e A} . 

Then O \ f2 e is a closed (non-compact) set disjoint from CI. 
Theorem 3.6 [K4]. Let F = {g t \t> 0} be as m (2.1). Then 

F ^LyZ k + ^ q ^ ^ is bounded and stays away from CI 

F [hyl} + (^yj C n \ A e /or some e > 

(y, b) is 6ad/?/ approximable 

The proof is basically a slight modification of ideas involved in the proof of 
Theorem 3.3. It follows from the results of [BM] (see also [Ma5]) that the F-action 
on CI is mixing. Since CI C CI is closed, null and g t -invariant, Theorem 3.5 applies 
and one gets 

Corollary 3.7. The set of badly approximable (Y,b) € M mxn (K) has full Haus- 
dorff dimension. 

See [K4] for details, remarks and extensions. 

We close the section by stating a theorem generalizing Dani's correspondence 
(Theorem 3.3) to ^-approximable systems. First we need a simple "change of 
variables" lemma. 

Lemma 3.8. Fix m,n E N and x > 0, and let ip : [x ,oo) (0, oo) be a non- 
increasing continuous function. Then there exists a unique continuous function 
£ : [to, oo) i ► (0, oo), where e kt ° — x™ /ip(xo) n , such that 

(3.3a) the function 1 1— > e t e(t) n is strictly increasing and unbounded, 



(3.3b) the function 1 1— > e t e(t) m is nonincreasing , 

and 



(3.4) 



V>(e*e(i)") = e-*e(i) ro Vt>t . 
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Conversely, given to G M and a continuous function e : [to, oo) i— > (0, oo) s«c/i iftai 
(3.3ab) ZioZd, t/iere eziste a unique continuous non-increasing function ip : [x , oo) 
(0,oo), wit/i xo — e ta e(t ) n , satisfying (3.4). 

See [KM3] for the proof. In many cases one can explicitly solve (3.4) to express 
e(-) knowing tp(-) and vice versa. For example if VK 2 -) = cpoix) = c /^j the equation 
(3.4) gives ce~ t e(t)^ n = e^ t e(t) m , and one sees that the corresponding function e 
is constant (more precisely, e(t) = c 1//fc ). Or one can take ip(x) = cipp(x), where 
i/)p(x) = jttf, P > 0; then e(t) decreases exponentially, namely 



e(t) = c^e-^ , where 7 = (1 - A - ■ 



- m 

Now we can state a generalization of Theorem 3.3: 

Theorem 3.9. Y G M mx „(R) is ip-approximable iff there exist arbitrarily large 
positive t such that gtLyll 1 £ ^ e (t) 1 w/iere {34} is as in (2.1), Ly as in (1.3), and 
e(-) is the function corresponding to ip as in the previous lemma. 

Loosely speaking, good rational approximations for Y correspond to far excur- 
sions of the orbit into the "cusp neighborhoods" Q e . In other words, one can 
measure the "growth rate" of the orbit in terms of hitting the sets E ( t ) in time 
t for infinitely many f e N, and fast-growing orbits would correspond to systems 
approximable with a fast-decaying approximation function. 

It is shown in [KM3] how the above correspondence provides an alternative 
(dynamical) proof of Theorem 3.2. More precisely, one can use ergodic properties 
of the (/(-action on f2 (exponential decay of correlations, see Theorem 2.2) to prove 
the following 

Theorem 3.10. Lete(-) be any positive function. Then for almost all (resp. almost 
no) A G il one has g t A G fi £ ( 4 ) for infinitely many t G N, provided the sum 



(3.5) £ £ (t)* 



t=i 

diverges (resp. converges). 

We remark that the ratio [i(Q E )/s k is shown in [KM3] to be bounded from 
both sides; therefore the sum (3.5) is finite/infinite iff so is X^t^i M^e(t))- This 
places the above theorem in the rank of Borel-Cantelli type results. See [KM3] for 
generalizations and applications, and [CK, CR, Dol2, Ph, Su] for other results of 
similar flavor. 

Another application of the correspondence of Theorem 3.9 will be given in the 
next section. 



4. DlOPHANTINE APPROXIMATION ON MANIFOLDS 

We start from the setting of the previous section but specialize to the case 
m = 1; that is, to Diophantinc approximation of just one linear form given by 
y£l". Recall that Theorem 3.2 says that whenever X^Si VKO is finite, almost 
every y is not ?/>-approximable; that is, the inequality 



(4.1) 



|q-y+p| <^(l|q||") 
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has at most finitely many solutions. It is instructive to sketch an elementary proof: 
for fixed p, q, the set of y satisfying (4.1) is a ^jjqjj ^ -neighborhood of a hyperplane 

(4.2) q-Y+P = 0; 

thus if one restricts y to lie in [0,1]™ (or any other bounded subset of K"), the 
set of solutions will have measure at most const- ^jjqjj ^ - Since there are at most 
const- ||q|| admissible values of p, the sum of measures of all sets of solutions is at 
most 

oo oo 
qGZ" 1=1 1=1 

and the proof is finished by an application of the Borcl-Cantclli Lemma. 

Recall that ^(z) = x^ 1+(j \ (5 > 0, was one of the examples of functions 
realizing the convergence case in the Khintchine-Groshev Theorem. Say that y G 
K" is very well approximablc (VWA) if it is V^-approximable for some (3 > 0. Thus 
almost all y G K" arc not VWA. 

Now consider the following problem, raised by Mahler in 1932 [M]: is it true that 
for almost all x G K the inequality 

\p + q lX + q 2 x 2 + ■ + q n x n \ < ||q|r n(1+/3) 

has at most finitely many solutions? In other words, for a.e. the n-tuple 

(4.3) y(x) = (x,x\...,x n ) 

is not VWA. The proof presented above does not work, since this time one has to 
estimate the measure of intersection of the curve (4.3) with the sets of solutions 
of inequalities (4.1), and for some choices of p, q (namely for those which make 
the hyperplane (4.2) nearly tangent to the curve (4.3)) it is hard to produce a 
reasonable estimate. 

This problem stood open for more than 30 years until it was solved in 1964 
by Sprindzuk [Spl, Sp2]. Earlier several special cases were considered, and, quot- 
ing Sprindzuk's survey paper, the problem rapidly revealed itself to be non-trivial 
and involving "deep and complicated phenomena in which arithmetical properties 
of numbers are closely entangled with combinatorial-topological properties" of the 
curve. The solution to Mahler's problem has eventually led to the development of 
a new branch of metric number theory, usually referred to as "Diophantine approx- 
imation with dependent quantities" or "Diophantine approximation on manifolds" . 
We invite the reader to look at Sprindzuk's monographs [Sp2, Sp3] and a recent 
book [BD] for a systematic exposition of the field. 

Mahler's problem and its generalizations have several motivations. The original 
motivation of Mahler comes from transcendental number theory. Indeed, the n- 
tuple (4.3) is not VWA if and only if for every (3 > there are at most finitely 
many polynomials P G 7L\x\ with degree at most n such that |i^(ar)| < h{P)~ n ( 1+l3 \ 
where h(P) is the height of P; loosely speaking, x is "not very algebraic", and the 
affirmative solution to the problem shows that almost all x are such. 

Another motivation comes from KAM theory: it is known that behavior of per- 
turbation of solutions of ODEs is related to Diophantine properties of coefficients. 
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If the latter are restricted to lie on a curve or submanifold of R™, it may be impor- 
tant to know that almost all values have certain approximation properties. See [de 
la Llave's lectures, this volume] and [BD, Chapter 7]. 

However, from the author's personal viewpoint, the appeal of this branch of 
number theory lies in its existing and potential generalizations. In a sense, the 
affirmative solution to Mahler's problem shows that a certain property of y 6 R" 
(being not VWA) which holds for generic y € R™ in fact holds for generic points on 
the curve (4.3). In other words, the curve inherits the above Diophantinc property 
from the ambient space, unlike, for example, a line y(x) = (x, . . . , x) - it is clear 
that every point on this line is VWA. This gives rise to studying other subsets 
of R" and other Diophantine properties, and looking at whether this inheritance 
phenomenon takes place. 

Note that the curve (4.3) is not contained in any affine subspace of R™ (in other 
words, constitutes an essentially n-dimensional object). The latter property, or, 
more precisely, its infinitesimal analogue, is formalized in the following way. Let 

V be an open subset of R d . Say that an ra-tuplc f = (/i, . . . , /„) of C l functions 

V i— > R is nondegenerate at x e V if the space R™ is spanned by partial derivatives 
of f at x of order up to I. If M C R™ is a d-dimensional smooth submanifold, we will 
say that M is nondegenerate at y G M if any (equivalently, some) diffeomorphism 
f between an open subset V of R d and a neighborhood of y in M is nondegenerate 
at f _1 (y). We will say that f : V — ► R™ (resp. M C R") is nondegenerate if it 
is nondegenerate at almost every point of V (resp. M, in the sense of the natural 
measure class on M). If the functions fi are analytic, it is easy to see that the linear 
independence of 1, /i, . . . , /„ over R in V is equivalent to all points of M = f (U) 
being nondegenerate. Thus the above nondegeneracy condition can be viewed as 
an infinitesimal version of not lying in any proper affine subspace of 1™. 

It appears that many known and anticipated results in the field fall in the frame- 
work of the following vague 

Meta-Conjecture. "Any" Diophantine property of vectors in an ambient space 
(e.g. R™ ) which holds for almost all points in this space should hold for generic 
points on a nondegenerate smooth submanifold M of the space. 

It was conjectured in 1980 by Sprindzuk [Sp4, Conjecture Hi] that almost all 
points on a nondegenerate analytic submanifold of R™ are not VWA. This conjecture 
was supported before and after 1980 by a number of partial results, one of the first 
being Schmidt's proof [SI] for nondegenerate planar curves. The general case was 
settled in 1998 by Margulis and the author using the dynamical approach. Namely, 
the following was proved: 

Theorem 4.1 [KM2]. Let M be a nondegenerate smooth submanifold o/R™. Then 
almost all points of M are not VWA. 

This is the result we will focus on later in this section. 

In another direction, Sprindzuk's solution to Mahler's problem was improved 
in 1964 by Baker [Bl] and later (1984) by Bernik [Bern, BD]; the latter proved 
that whenever X^z^i VKO is finite, almost all points of the curve (4.3) are not ip- 
approximable. And several years ago Beresnevich [Berel] proved the divergence 
counterpart, thus establishing a complete analogue of the Khintchine-Groshev The- 
orem for the curve (4.3). 
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It turned out that a modification of the methods from [KM2] allows one to prove 
the convergence part of the Khintchine-Groshev Theorem for any nondegenerate 
manifold. In other words, the following is true: 

Theorem 4.2. Let M be a nondegenerate smooth submanifold ofW 1 and let ip be 
such that * s finite. Then almost all points of M are not tp-approximable. 

This is proved in [BKM] and also independently in [Bere2]. A work on the 
divergence case is currently in progress. (We note that the main result of [BKM] 
has a stronger "multiplicative" version (see Theorem 5.3) which is currently not 
doable by classical Sprindzuk-style methods developed in [Bere2].) 

In this survey we will indicate a proof of Theorem 4.1 by first restating it in the 
language of flows on the space of lattices. For this we set k = n + 1 and look at the 
one-parameter group 

(4.4) 5t =diag(e*, e - t /",...,e- t /"). 

acting on O = SXfc(IR)/ S'Lfe(Z), and given y G M™, consider L y =' 

(cf. (2.1) and (1-3)). It follows from Theorem 3.9 and the example discussed after- 
wards that y G M™ is VWA iff for some 7 > there exist arbitrarily large positive 
t such that 

(4.5) g t L y Z k G fi e -r« . 

Equivalcntly, for some 7 > there are infinitely many t G N such that (4.5) holds. 

With this in mind, let us turn to the setting of Theorem 4.1. Namely let V be 
an open subset of M d and f = (/1, . . . , /„) an n-tuple of C k functions 7hR which 
is nondegenerate at almost every point of V. The theorem would be proved if we 
show that for any 7 > the set 

{x G V g t Lf( x jZ k G f2 e — yt for infinitely many t G N} 

has measure zero. In other words, a submanifold f (V) of K" gives rise to a subman- 
ifold L f (y)Z fe of the space of lattices, and one needs to show that the growth rate of 
generic orbits originating from this submanifold is consistent with the growth rate 
of an orbit of a generic point of (see Theorem 3.10 for an explanation of why 
lattices A such that <?(A G Sl e -tt for infinitely many t G N form a null subset of £1). 

Now one can use the Borel-Cantelli Lemma to reduce Theorem 4.1 to the fol- 
lowing statement: 

Theorem 4.3. Let V be an open subset of M. d and f = {fi, ■ ■ ■ , f n ) an n-tuple 
°f C k functions V 1— > M which is nondegenerate at xq C V . Then there exists a 
neighborhood B of xq contained in V such that for any 7 > one has 

00 

(4.6) ^2\{x G B I g t L f(x) Z k e e - 7t }| <oo. 
t=i 

Here is the turning point of the argument: t-dynamics gives way to x-dynamics, 
namely, a natural way to demonstrate (4.6) is to fix t and think of the set {g t L f ^ x ' ) Z k } 
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as of an orbit of certain action (not a group action!), the goal being to prove that 
a substantial part of this "orbit" lies outside of "cusp neighborhoods" Q e - y t uni- 
formly for all t. What immediately comes to mind is the recurrence property of 
unipotent orbits, that is, Theorem 2.4. And it turns out that a modification of the 
argument used to prove the latter theorem allows one to estimate the amount of 
"time" x that the "trajectory" x i-» g t L t ^ x - ) Z k spends "close to infinity" in £1. More 
precisely, the following can be proved: 

Theorem 4.4. Let V , f and xq be as in Theorem 4-3. Then there exists a neigh- 
borhood B of x n contained in V and constants C,p > such that for any e > and 
any positive t one has 

(4.7) \{xeB\ g t L f{x) Z k e £ }| < Ce^ dn \B\ . 

It is straightforward to verify that Theorem 4.3 follows from the above uniform 
estimate. 

It remains to explain why the behavior of the curve x i— > g t L^ x ^L k is similar to 
that of the unipotent orbit. In fact it has been understood a long time ago that 
the main property of the unipotent actions on which the recurrence estimates are 
based is the polynomial dependence of u x on x (see [Ma2, Sh]). One may wonder 
what is so special about polynomials - and it turns out that the crucial property 
is roughly "not making very sharp turns". More precisely, here is the definition 
motivated by the analysis of the proofs in [Ma2] (see also [EMS]): for C, a > say 
that a continuous function / on an open set V C M d is (C, a) -good on V if for any 
open ball B C V and any positive e one has 

\{xeB \ \f(x)\<e-sup\f(x)\}\ <Ce a \B\. 

xeB 

In other words, a good function which takes small values on a big part of a ball is 
not allowed to grow very fast on the remaining part of the ball. The main example 
is provided by polynomials: 

Lemma 4.5. Any polynomial f G M.[x] of degree not greater than k is (4fc,l/fc)- 
good on K. 

This easily follows from Lagrange's interpolation formula, see [DM2, KM2]. The 
next theorem (the main result of [KM2]) therefore provides a generalization of 
Theorem 2.4 to polynomial trajectories on f2. To state it we need to introduce some 
notation. If A is a discrete subgroup of R fc (not necessarily a lattice) generated 
by vi, . . . , Vj, let us measure its norm, ||A||, by the norm of the exterior product 
Vi A ■ • ■ A Vj . For this one needs to extend the norm from R fc to its exterior algebra. 
If ei, . . . , efc are standard base vectors of M fe , the elements e 7 = f e^ A • • • A e i( , 
/ = {ii, . . . , ii} C {1, . . . , k} form a basis of /\ l (R k ). Since Diophantine applications 
call for the supremum norm, we will extend |[ • || to A(^ fe ) by setting || ^ 7 w>jej|| = 
max/ \ wi\. 

Now let us consider a curve in given by x *—> h{x)T, k , where h is some func- 
tion from M. d to GLfc(M). It turns out that in order to understand its recurrence 
properties one has to keep an eye on norms of all discrete subgroups of h(x)Z k ; 
in particular, it will be necessary to prove that all those norms (as functions of x) 
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are (C, a)-good for some C, a. In fact, it will suffice to look at the coordinates of 
h(x)(v\ A - • ■ Avj) where vi, . . . , vj form a basis of A C Z k ; one can easily show that 
if all components of a vector function are (C, a)-good, the norm of this function is 
also (C, a)-good. 

Theorem 4.6. Let d, k E N, C, a > 0, < p < l/k, and let a ball B = B(x ,r ) C 
R d and a map h : B — > GLfc(K) be given, where B stands for B(xq, 3 k r ) ■ Assume 
that for any subgroup A of Z fe , 

(i) the function x i— ► ||/i(x)A|| is (C, a)-good on B; 

(ii) suv xeB \\h(x)A\\ > p. 

Then for any positive e < p one has 



Corollary 4.7. For any lattice A in R k there exists a constant p — p(A) > such 
that for any one-parameter unipotent subgroup {u x } xe m. of SLfe(K), for any T > 
and any e < p, one has 



This is clearly a quantitative strengthening of Theorem 2.4, with an explicit 
estimate of S in terms of e. 

Proof. Write A in the form gZ k with g E GLfe(R), and denote by h the function 
h(x) = u x g. For any AcZ' with basis Vi, . . . , Vj, the coordinates of h(x)(vi A 
• • • A Vj ) will be polynomials in x of degree not exceeding k 2 . Hence the functions 
x I— > \\h(x)A\\ will be (C, l/fe 2 )-good on R, where C is a constant depending only 

on k. Now let p =' min (l/k, ini AeC ( Z k^ ||gA||), positive by the discreteness of A 
in R fe . Then ||/i(0)A|| > p for any A C Z k , therefore, with the the substitutions 
B = (0,T), a = l/k 2 and d = 1 assumptions (i) and (ii) of Theorem 4.6 are 
satisfied, and one immediately gets (4.9) from (4.8). □ 

As was mentioned in §2, the proof of Theorem 4.6 is based on delicate com- 
binatorial (partially ordered) structure of the space of lattices, and the reader is 
referred to [KM2] or [BKM] (most of the ideas are borrowed from [Ma2] and [D3]). 
Assuming the latter theorem, we conclude by presenting a 

def 

Sketch of proof of Theorem 4-4- Take a positive t and consider h(x) = gtLf( x y, 
clearly all one needs to prove (4.7) is to check conditions (i) and (ii) of Theorem 
4.6 for every A C Z l . An elementary computation shows that the coordinates of 
h(x)(vi A • • • A v;) for any choice of vectors are linear combinations of functions 
/i, . . . , /„ and 1. Consider first the case of original Mahler's conjecture, with d = 1 
and fi(x) = x l . Then, as in the proof of Corollary 4.7, condition (i) is automatic 
due to Lemma 4.5. Further, a straightforward computation of the action of h(x) 
on exterior products of vectors in R fc shows that at least one coefficient of at least 
one polynomial arising as a coordinate must have absolute value not less than 1. 



(4.8) 




(4.9) 
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This implies that for every interval B there exists a constant p (independent of t) 
such that (ii) holds. 

It remains to pass from this special case to the general situation of functions 
fx, ... , f n on R d coordinatizing a nondegenerate submanifold of W l . Here one 
basically has to show that locally these functions behave like polynomials. Indeed, 
the following was proved in [KM2]: 

Lemma 4.8. Let f = (/i, . . . , f n ) be a C l map from an open subset V of R d to 
R™ ; and let xo £ V be such that M™ is spanned by partial derivatives off at xq of 
order up to I. Then there exists a neighborhood U C V of xq and positive C such 
that any linear combination of 1, /i, . . . , /„ is (C, l/dl)-good on V . 

Now to finish the proof one simply has to choose U according to the above lemma, 
then pick a ball B centered at xq and contained in U , and finally take B to be a con- 
centric ball with radius 3 k times smaller. This implies condition (i) with a = 1/dn 
and some constant C independent of A and t, and (ii) follows as a result of a compu- 
tation described above: one shows that at least one coordinate of h(x)(vi A - ■ • Avj) 
must have the form cq + X)"=i c ifi( x ) with max(|ci|, . . . , |c„|) > 1, and therefore 
one gets a lower bound (again independent of t and A) for sup x€B \\h(x)A\\. □ 

5. Multiplicative approximation 

We have already seen in Conjecture 1.2 how the magnitude of the integer vector 
q was measured by taking the product of coordinates rather than the maximal 
coordinate (that is the norm of the vector) . Let us formalize it by saying, for ip as 
before, that Y £ M mxn (R) is ip-multiplicatively approximable (ip-MA) if there are 
infinitely many q £ Z" such that 

n(Fq + p) < V(n+(q)) for some p e z m . 

where for x = [x\, . . . , Xk) 6 K fe one defines II(x) = IIi=i \ x i\ an d n+( x ) = 
IL=i maxda;^, 1). Clearly any i/'-approximable system of linear forms is automat- 
ically tp-MA, but the converse is not necessarily true. Similarly to the standard 
setting, one can define badly multiplicatively approximable (BMA ) and very well 
multiplicatively approximable (VWMA) systems. It can be easily shown that al- 
most no Y e M mxn (R) are tp-MA if the sum 

oo 

1=1 

converges (here we again set k = m + n); in particular, VWMA systems form a set 
of measure zero. The converse (i.e. a multiplicative analogue of Theorem 1.2) can 
be proved using methods of Schmidt; the case n = 1 is contained in [G]. 

On the other hand, saying that a vector y £ K™ (viewed as a linear form q i— > y-q) 
is not BMA is equivalent to (1.1); in other words, Conjecture 1.2 states that no 
y £ M™, n > 2, is badly multiplicatively approximable. A more general statement 
that no Y £ M mxn (IR) is BMA unless m — n = 1 in fact reduces to this conjecture; 
moreover, as it is the case with Theorem 1.1, it is enough to prove Conjecture 1.2 
for n = 2. 

It seems natural to bring lattices into the game. In fact, one can observe the 
similarity between the statements of the Oppenheim and Littlewoods's conjectures. 
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Indeed, to say that Y G M mxn (R) is not BMA amounts to saying that is the 
infimum of absolute values of a certain homogeneous polynomial at integer points. 
Similarly to what was done for quadratic forms, a linear change of variables trans- 
forms this polynomial into the product of coordinates II (x), x G M. k , and according 
to the scheme developed in the preceding sections, the dynamical system reflecting 
Diophantinc properties of A must come from the group stabilizing II, that is, the 
full diagonal subgroup D of SXfc(R). Thus the problems rooted in multiplicative 
Diophantinc approximation bring us to higher rank actions on f2 (in implicit form 
this was already noticed in the paper [CS] of Cassels and Swinnerton-Dyer). More 
precisely, one can state a multiplicative version of generalized Dani's correspon- 
dence (Theorem 3.8), relating multiplicative Diophantine properties of A to orbits 
of the form {(/LyZ fe | g 6 D + } where D + is a certain open chamber in the group 
D. (For a version of such a correspondence see [KM3, Theorem 9.2].) 

We illustrate this principle by two examples below, where for the sake of simplic- 
ity of exposition we specialize to the case m = 1 (one linear form q i— > yq, y G W 1 ), 
setting k = n + 1. We will need the following notation: for t = (t\, . . . , t n ) G W 1 
let us denote Ym=\ U by t and define 

(5.1) g t = diag(e t ,e^ 1 ,...,e- t ") G SL k (R) . 

Littlewoods's conjecture. One can show that y G K™ satisfies (1.1) (that is, it 
is BMA) iff the trajectory {g t L y Z k \ t G K" } is bounded in the space of lattices 
in R k . Thus Conjecture 1.2 is equivalent to the statement that every trajectory as 
above is unbounded. In fact, an argument rooted in the "Isolation Theorem" of 
Cassels and Swinnerton-Dyer [CS, LW, Ma4] shows that the latter statement can 
be reduced to the following 

Conjecture 5.1. Let D be the subgroup of diagonal matrices in SLk(M.), k > 3. 
Then any relatively compact orbit DA, A a lattice in R k , is compact. 

Notice that Theorem 3.5 shows that the above statement does not hold if k = 2. 
This highlights the difference between rank-one and higher rank dynamics. Note 
also the similarity between Corollary 2.6 and Conjecture 5.1, showing that higher 
rank hyperbolic actions share some features with unipotent dynamics. In fact, 
Conjecture 5.1 is a special case of a more general hypothesis, see [Ma8, Conjecture 
1], which, roughly speaking, says that for a connected Lie group G, a lattice T C G 
and a closed subgroup H of G, any orbit closure Hx, x G G/T, is an orbit of an 
intermediate subgroup L D H of G unless "it has a good reason not to" (the latter 
reasons must be coming from certain one-parameter quotient actions). See [Ma8, 
§1] for more detail. 

Multiplicative approximation on manifolds. Since every VWA vector is VWMA 
(that is, -0 e -MA for some e > 0) but not other way around, it is a more diffi- 
cult problem to prove that a generic point on a nondegenerate manifold is not 
very well multiplicatively approximablc. This has been known as Conjecture H 2 of 
Sprindzuk [Sp4]; the polynomial special case (that is, a multiplicative strengthen- 
ing of Mahler's problem) was conjectured by Baker in [B2]; both conjectures stood 
open, except for low-dimensional special cases, until [KM2] where the following was 
proved: 
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Theorem 5.2. Let M be a nondegenerate smooth submanifold ofW 1 . Then almost 
all points of M are not VWMA. 

The strategy of the proof of Theorem 4.1 applies with minor changes. One 
shows (see [KM2, Lemma 2.1 and Corollary 2.2] for a partial result) that yel" 
is VWMA iff for some 7 > there are infinitely many t £ Z™ such that 

(5.2) g t L y Z k £ Q e -yt 

(here, as defined above, t stands for Ym=i **)■ Therefore it is enough to use Theorem 
4.6 to prove a modification of the measure estimate of Theorem 4.4 with g t as in 
(4.5) replaced by gt as in (5.1). 

Finally let us mention a multiplicative version of Theorem 4.2, proved in [BKM] 
by a modification of the method described above: 

Theorem 5.3. Let M be a nondegenerate smooth submanifold of R n and let tp 
be such that (l°g 0" _1 ^(0 * s finite. Then almost all points of M are not 

ip-multiplicatively approximable. 
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